Towards more natural synthetic speech

نویسنده

  • Pilar Manchón Portillo
چکیده

This article reports the results of two experiments in which factors such as duration, amplitude and noise are manipulated, in order to achieve more natural utterances in synthetic speech. The participants were native speakers of English, instructed to judge the naturalness of the different versions of utterances generated throughout the manipulations. The results indicate that there are signif icant individual preferences, as well as classification principles other than conventional ones. There is evidence to believe that further research in this area will render positive results in the search for naturalness. The same principles could be applied to search for naturalness in the prosodic structure of the synthetic utterances. Advancement in this area will surely render improvements in Spoken Dialogue Systems.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation of synthetic and natural Mandarin visual speech: Initial consonants, single vowels, and syllables

Although the auditory aspects of Mandarin speech are relatively more heavily-researched and well-known in the field, this study addresses its visual aspects by examining the perception of both Mandarin natural and synthetic visual speech. In perceptual experiments, the synthetic visual speech of a computer-animated Mandarin talking head was evaluated and subsequently improved. Also, the basic (...

متن کامل

The Temporal Delay Hypothesis: Natural, Vocoded and Synthetic Speech

Including disfluencies in synthetic speech is being explored as a way of making synthetic speech sound more natural and conversational. How to measure whether the resulting speech is actually more natural, however, is not straightforward. Conventional approaches to synthetic speech evaluation fall short as a listener is either primed to prefer stimuli with filled pauses or, when they aren’t pri...

متن کامل

A Wavelet-Based Technique Towards a More Natural Sounding Synthesized Speech

This paper presents a wavelet-based technique to increase the quality and naturalness of LPC based synthesized speech signals. The proposed method is based on wavelet decomposition. We first obtain the wavelet coefficients, and then the variances of the wavelet coefficient at the last four scales (correspond the higher frequency region) of the synthetic speech are replaced by the original varia...

متن کامل

Sampling-Based Speech Parameter Generation Using Moment-Matching Networks

This paper presents sampling-based speech parameter generation using moment-matching networks for Deep Neural Network (DNN)-based speech synthesis. Although people never produce exactly the same speech even if we try to express the same linguistic and para-linguistic information, typical statistical speech synthesis produces completely the same speech, i.e., there is no inter-utterance variatio...

متن کامل

Pitch accent type matters for online processing of information status: Evidence from natural and synthetic speech∗ AOJU CHEN, ELS DEN OS AND JAN

Adopting an eyetracking paradigm, we investigated the role of H*L, L*HL, L*H, H*LH, and deaccentuation at the intonational phrase-final position in online processing of information status in British English in natural speech. The role of H*L, L*H and deaccentuation was also examined in diphonesynthetic speech. It was found that H*L and L*HL create a strong bias towards newness, whereas L*H, lik...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Procesamiento del Lenguaje Natural

دوره 29  شماره 

صفحات  -

تاریخ انتشار 2002